CEA LIST's Participation at the CLEF CHiC 2013
نویسنده
چکیده
For our first participation to the CLEF CHiC Lab, we submitted runs to the multilingual ad-hoc and multilingual semantic enrichment tasks. Given the strong multilingual character of the evaluation corpus, the main objectives of the experiments were to test the efficiency of semantic topic expansion and consolidation based on Explicit Semantic Analysis (ESA) versions in different languages. Another objective was multilingual fusion of results obtained in the different languages of the corpus. ESA was adapted for the 10 languages that are best represented in the Europeana corpus. Wikipedia dumps from March 2012 were used for French and English and from March 2013 for the other languages. One problem that arises when modeling short documents, such as queries, with classical ESA vectors no information is available whether the concept is related to the entire topic only to a part of it. To overcome this problem, two adaptations of ESA (ESA-C) are Wikipedia concepts that are linked to the highest number of concepts from the original topic. In the ad-hoc task, ESA and ESA-C have two roles: to expand the topic with related concepts and to create consolidated topic models which contain the original topic words along with other related keywords. We submitted both monolingual and multilingual runs without topic expansion and using topic expansion and consolidation with classical ESA and ESA-C. The best results are obtained in a multilingual setting with no expansion and ESA-C topic consolidation. For the semantic enrichment task, we propose lists of related Wikipedia concepts using either a monolingual ranking or a voting scheme that surfaces related concepts that appear in the largest number of languages. Here, the best results are obtained in a monolingual ranking configuration that exploits ESA-C.
منابع مشابه
RSLIS/AAU at CHiC 2013
In this paper, we describe our participation in the interactive and adhoc tracks of CHiC 2013.
متن کاملCEA LIST's participation to the Scalable Concept Image Annotation task of ImageCLEF 2015
This paper describes our participation to the ImageCLEF 2015 scalable concept image annotation task. Our system is only based on visual features extracted with a deep Convolutional Neural Network (CNN). The network is trained with noisy web data corresponding to the concepts to detect in this task. We introduce a simple concept localization pipeline that provides the localization of the detecte...
متن کاملCEA LIST's Participation at MediaEval 2013 Placing Task
At MediaEval Placing Task 2013 [2], we focused on improving our last year’s participation in four directions: (1) exploit a larger geotagged dataset in order to improve the quality of a standard geolocation language model, (2) model machine tags, (3) estimate the geographicity of tags associated to geolocated photos and (4) exploit user cues in order complement language models whenever these la...
متن کاملCEA LIST's Participation to Visual Concept Detection Task of ImageCLEF 2011
This paper describes the CEA LIST participation in the ImageCLEF 2011 Photo Annotation challenge. This year, our motivation was to investigate the annotation performance by using provided Flickr-tags as additionnal information. First, we present an overview of our local and global visual features used in this work. Second, we present a new method, that we call ”Fuzzy-tfidf”, which takes into ac...
متن کاملUsing the Divergence Framework for Randomness: CHiC 2013 Lab Report
For this first participation to the CHiC Lab, we focused on understanding the challenges of working with a collection of cultural heritage objects with short textual descriptions and on how to fine-tune a set of weighting models from the probability models based on Divergence From Randomness to perform uniformly in monolingual and multilingual scenarios. The official runs submitted used PL2 as ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013